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ABSTRACT 

This study is devoted to Big Data problems in 
biometric technologies. Before the traditional 
database were not capable to handle the large amount 
of data, so we highlight the concept of using Big Data 
technology in solution of a wide range of issues faced 
by the national security, law enforcement system and 
defence authorities. The study provides information 
about Big Data in the field of biometric technology. 

Keyword: Biometrics, Big Data, Recognition, 
Hadoop. 

INTRODUCTION 
> Big Data 

Big data means really a big data; it is a collection of 
large datasets that cannot be processed using 
traditional computing techniques or it is set of data 
that are voluminous and complex data. Big Data 
challenges include capturing data, data storage, data 
analysis, search, sharing, transfer, visualization, 
querying, updating, and information privacy 

The main purpose of Big Data is to improve 
efficiency for using the large volume of data. If the 
Big Data is used in proper manner than organizations 
can pursue new business model and can achieve 
competitive advantages. 

It includes huge volume, high velocity, and extensible 
variety of data. 

The data in it can be of three types: 

1. Structured data: It can be Relational Data. 

2. Semi Structured data: They are in form of XML 
data. 

3. Unstructured data: It consists of Word, PDF, Text, 
Media files. 


> Characteristics of Big Data 
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Figure 1. Characteristics of Big Data [13] 

1. Volume: 

When talking about the Volume, it refers to the 
insanely large amount of data or quantity of data. 

2. Variety: 

It refers to the different data types, degree of 
structured, semi-structure and unstructured data which 
doesn’t fit easily into database application. 

3. Velocity: 

It refers to the speed of data how fast the data are 
generated every second. Example: Google, Facebook. 

4. Veracity: 

It refers to the trustworthiness of the data in terms of 
accuracy. 

5. Value: 

It means having large amount of data is good, but it is 
useless if it can’t be turned into values. 

> Advantages of Big Data 

1. Using the information kept in the social network 
the marketing agencies are learning about the 
response for the campaigns, promotions, and other 
advertising medium. Ex:-Instagram. 

2. Companies are allowed to store more transactional 
data in digital format. 

3. Using the data regarding the pervious medical 
history patients, hospital are providing better and 
quick service. 
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4. Some grate benefits of Big Data are cost 
reduction, Real time website customization, swift 
- better decision making. 

> Biometric Recognition 

It is a technology used to identify, analyze and 
measure an individual’s physical and behavioural 
characteristics. Those data are captured from the 
human body and are compared to the data that are 
stored in set of database. 

The common biometric authentication techniques 
used is Finger prints biometric authentication. Some 
other techniques which can be used are Face 
recognition, Iris Scan, voice Recognition. Today 
biometrics system have been implemented in various 
commercial, civilian and forensic application as a 
means of establishing identity. 

Biometrics is used for authenticating and authorizing 
a person. Authentication of a person through 
biometric is highly challenging. It is used in forensics, 
secured access, prison security, medical, and robotics 
areas financial services, ecommerce, 
telecommunication, government, traffic and many 
more. New technology or technique can be used is ear 
recognition of individuals. 

Biometrics authentication has two types of Model for 
Authentication Process: 

1. Unimodal 

Unimodal biometrics refers to the use of only one 
biometrics modalities in an identification system. 
Example: Fingerprint recognition 

2. Multimodal 

Multimodal biometrics refers to the use of a 
combination of two or more biometrics modalities in 
an identification system 
Example: Face recognition and Iris scan. 



LITERATURE REVIEW: 


In Paper [6],It gives the importance of using big data 
in order to support decision for the strategies and they 
also chose to invest more in processing larger data 
sets than investing in algorithms which are expensive. 
It also states that working with large amount of data is 
a challenging work but it provides better output. 

The paper also provides the brief introduction to the 
Hadoop technology with its computing solutions. It 
also uses the concept of multi-node Hadoop cluster 
and its importance. It provides the concept of big data 
and the technologies involved to understand the new 
technology. 



In paper, it provide new different techniques to store 
the data. In Big Data provides three main 
characteristics high volume, high velocity & high 
variety. It is cost effective in storing the data and 
helps in better decision making. 

Now a day's, it helps in securing locations and other 
data. It also represents the concept of multi model 
biometric data through big data. It also have an brief 
idea about the Unimodal biometric system. The 
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biometrics devices typically used are hand geometry 
reader and fingerprint authentication. 

In Paper[8], discussed about the big data and big data 
challenges in biometrics. It highlights the use of big 
data technologies. It also provides characteristics of 
the big data and types of data which can be generated- 
structured, semi structured unstructured data. It 
provides the information about the biometrics and 
problems that are occurred in biometric technologies. 

Brief discussion about the cloud technologies 
regarding how the cloud technology can be used to 
recognize the person’s characteristics. They are 
working on developing advance big data capabilities 
to prevent from terrorism and criminal investigation. 

In Paper [9], Discussed about the Unimodal and 
multi-modal biometric system. Multi-modal provides 
the reasons for reducing the rates. Levels of Fusion 
used in multimodal biometric system. It also provides 
the comparison between the Unimodal and 
multimodal and limitations of Unimodal can be 
overcome by the multimodal. 

In paper [2], The discussion is done about the ear 
recognition system. It is the currently used technology 
in biometric identification of human and it is quite 
complex task. It also provides the brief information 
about the model used for biometric authentication- 
Unimodal and multimodal. 

It also provides discussion about the existing 
techniques of human Authentication and identification 
- fingerprints (overview of fingerprint recognition 
with its operation, limitation and area of application 
and what instruments are required to identify the 
fingerprints.) same for face recognition and Iris scan. 
Main it provides how the ear recognition is done and 
the usefulness of ear based biometric identification 
system. 

In Paper [10], Multimode biometrics is suitable for 
mobile, PDAs, smart phones. They are portable 
subject to loss and theft. The use of enhanced security 
mechanism as biometric authentication. The use of 
teeth image and voice authentication. It is fact that the 
military deploys largest use of mobile devices of 
which are embedded system with tactics and military 
intelligence. Authentication to such by teeth image 
and voice, it will become difficult to the security of 
such device in the event of its loss or misplace. 


In Paper [11],Discussion about multimodal biometrics 
application and research area. It is used to combine 
two or more different sources of person sensed by 
different sensors. It include border management, 
interface for criminal and civil applications, and first 
responder verification. System can be improved by 
enhancing matching algorithms, integration of 
multiple sensor, and analysis of the scalability of 
biometrics system. 

Signature recognition requires little space to store the 
data. A disadvantage for signature recognition is that 
individuals who do not sign their names in a 
consistent manner may have difficulty enrolling and 
verifying in signature verification. Iris scan is 
believed to be highly stable over lifetime and uses an 
infrared imager to illuminate the eye and capture a 
very high-resolution image. Iris scan is difficult to 
capture for some individuals. In voice recognition 
sound sensations of a person is measured and 
compared to an existing dataset. Voice recognition 
Difficult to control sensor and channel variances that 
significantly impact. 

PROPOSED MODEL 

The multi model biometric data handling through big 
data framework is proposed. The multi model 
biometric framework first reads biometric (image, 
video, sound, etc.) data from user. Once the data is 
collected from the user the data is been entered into 
application framework. It identifies the biometric type 
and converts those data into binary form, same 
procedure is followed for other biometric data. The 
converted data are been merged into single biometric 
detail by using the algorithm. The binary data are now 
encrypted using the encryption algorithm. 

The authentication process, done when the user comes 
for biometric authentication, the multi model 
biometric framework reads the data of user than 
identify the type of biometric data and converts the 
data into binary data. Binary data is encrypted and 
those encrypted data is compared to the biometric 
data. If the compared data is matched with the data in 
database than it retrieves the binary data. After 
extracting binary data from big data Hadoop 
framework API, data will be decrypted and convert 
into original data form and user will get the real time 
authentication though multi model biometric 
framework. 


@ IJTSRD I AvailableOnline@www.ijtsrd.coml Volume-2 I Issue-6 ISep-Oct2018 


Page: 1275 



International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470 


Voice fecojnilijo 

HP 

y 


hpui 


DataEnavpikin 


Flnjerprirfct PecogTiitian 

tail 





. . . ilmiiF-r"■«- 

Wot 

Ftopruring 

* fanned 



Betirra Bec<jgnifi&n 


Input 



llshg Hadojp Framework 


fppfcalicfi 

Iro^armng' 

Frarr^woTk Get Rm! tir £ 

Data 


User 


fiabifecrypKbn 

Figure 7 Proposed Model of Hadoop [20] 


3.2 Map - Reduce Algorithm: 

It is a distributed data processing algorithm, useful for 
processing huge amount of data in cluster 
environment. 

This algorithm uses the concept of divide and conquer 
algorithm to process large amount of data. 

The input is divided into smaller and manageable sub¬ 
tasks to execute them parallel. 

This algorithm uses three main steps: 

1. Map Function 

2. Shuffle Function 

3. Reduce Function 


PROPOSED ALGORITHM: 

> K-means clustering algorithm: 

This algorithm is used for creating and analyzing the 
clusters. ‘N’ number of data points are divided into 
‘k’ cluster. Results generated from the algorithm is 
dependent on choosing the initial cluster centroid. 

The main feature, if data is arranged into a matrix, 
with row representing data points and column 
representing attributes of each data points, than matrix 
might not have defined values at many places. 

Algorithm: 

All data points have maximum M dimension which 
can be represented as row of M values (vl, v2...., 
vM).Select any m dimension from the data points. 
Arrange them in descending order of priority as, (dl, 
d2..., dm).The first dimension is known as primary 
dimension and others are called secondary dimension. 
Number of cluster is pre-decided-‘k’ which is an input 
to the algorithm. 

Step 1: the variation of each dimension is 
considered as, (max-min)/k, 1<= i<=m. 
where max is the maximum value of i 
dimension and the min is the minimum value 
of i dimension. 

Step 2: Initial cluster are formed using following 
conditions for data point, if mini + j * $1 <= 
vail <mil + (j+1) * $1, the data point 
belongs to cluster j. 

Step 3: Centroid of each cluster is computed means 
of all cluster points. 

Step 4: Every secondary dimension, 2<j<m,repeat 
the same. 


Algorithm: 

Map function: In this step, it takes input tasks and 
divide the task into smaller sub-tasks. It follows two 
sub-steps: 

1. Splitting: takes input and divide into smaller sub¬ 
datasets. 

2. Mapping: takes smaller sub-datasets and perform 
action on each sub-datasets. 

Shuffle function: In this step, also known as combine 
phase. It performs two sub-steps: 

1. Merging: combines all key-value pairs which have 
same key. 

2. Sorting: takes the input from merging step and 
sort all key-value pair by using keys. 

Reduce function: In this step, it takes the list of 
sorted key-value pairs from shuffle function and 
perform reduce operation. 



Figure8. Architecture of Map - Reduce [18] 

Conclusion and Future work: 

The literature survey done in this paper provides a 
new insight in different technique to store data into 
big data in the field of biometrics recognition. Big 
data provides high volume, high velocity and high 
variety of information. Big data provides the cost 
effective framework to store data. The scale of the 
biometric data to ensuring the integrity of biometric 
data by elimination duplicate records. 
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Today, the primary application of biometrics is in 
physical security: to control access to secure locations 
(rooms or buildings). Biometric system which rely on 
the evidence of multiple sources of information for 
establishing identity are called Multimodal biometric 
system. This paper presents an overview of 
multimodal biometrics data handling through big data. 
It also discuss their applications to develop the 
security system for high security areas. 

A lot of research work is still need in this area. In near 
future multi model biometric data handling can apply 
to enhance the security, real time response and cost 
cutting approach of our system. 
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